feat: add NanoGPT provider and missing ZAI GLM-5 models#13
Conversation
Add NanoGPT as a new credit-based aggregator provider with 22 models spanning Anthropic, OpenAI, Google, xAI, DeepSeek, Mistral, and Moonshot. Pricing is $0.00/token as NanoGPT uses a credit-based billing model. Also add three ZAI models missing from the catalog (glm-5, glm-5-turbo, glm-5.1) to match the current Z.ai API surface. Relates to ferro-labs/ai-gateway#120. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
Thanks for this @gr3enarr0w, and for moving it over here from the gateway repo 👍 I ran the full pipeline on the branch — 1. Pricing should be
|
| field | base opus-4.7 (truth) | this PR |
|---|---|---|
| context_window | 1,000,000 | 200,000 |
| max_output_tokens | 128,000 | 16,384 |
| vision | true | false |
| reasoning | true | false |
| prompt_caching | true | false |
Consumers that filter by capability (e.g. vision/reasoning support) will wrongly exclude these. The OpenRouter aggregator entries already in the catalog get this right (real per-model capabilities) — that's the bar. The cleanest path may be to mirror the base model's capabilities for each underlying model.
3. Verify the model IDs + point source: at a real endpoint (please confirm)
source: is the homepage (https://nano-gpt.com) rather than a model-listing endpoint. Some IDs are verifiable (claude-opus-4.7), but others I can't confirm exist on NanoGPT's API (gpt-5.5, gemini-3.5-flash, grok-4.20, grok-4.3, deepseek-v4-pro/flash, kimi-k2.6). Could you confirm each against NanoGPT's /v1/models response and set source: to that endpoint?
4. Minor: display_name consistency
Mixed styles — "Gemini 3.5 Flash via NanoGPT" vs raw "claude-opus-4.7 via NanoGPT". Worth normalizing to the cased form.
Note on gateway integration (not blocking this PR)
Heads up that the catalog entry alone won't make NanoGPT routable in the AI Gateway — the gateway discovers models live from each provider's API and doesn't consume this catalog for routing. Actually calling NanoGPT would need a thin OpenAI-compatible provider in ai-gateway/providers/nanogpt/ (the openrouter.go pattern transfers cleanly). That's a separate PR; this one is fine as a data-only contribution.
The structure here is solid — it's really just the pricing-encoding and capability accuracy on the NanoGPT files. Happy to help if any of the underlying-model values are unclear.
|
@gr3enarr0w One clarification on the pricing point above, so the recommendation isn't misleading: Switching NanoGPT to So |
Summary
aggregatorcategory provider with 22 models spanning Anthropic, OpenAI, Google, xAI, DeepSeek, Mistral, and Moonshot AI. Pricing is$0.00/token— NanoGPT uses a credit-based billing model.glm-5,glm-5-turbo,glm-5.1to match the current Z.ai API surface.This contribution was redirected here from ferro-labs/ai-gateway#120 per maintainer guidance (MitulShah1) that model catalog data is moving to this dedicated repo from release 1.2.0.
Test plan
validate.yml)🤖 Generated with Claude Code